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1. INTRODUCTION 


1.1. Background 


Turing machines which use a single tape both for their input and as their 
only working space (known as 1TTMs) appear to be extremely slow for 
many problems compared with multi-tape Turing machines and Random 
Access Machines (RAMs). However, apart from a few simple problems 
which require quadratic time on a 1TTM but have simple linear time solu- 
tions on a multi-tape Turing machine, there have been no general results 
proving this appearance to be correct. Indeed when the time for a RAM is 
reckoned by the reasonable logarithmic cost criterion (Aho et al, 1974; 
Cook and Reckhow, 1973), which will be adopted throughout this paper, 
it is not immediately obvious that all problems can be solved as fast on a 
RAM as on a 1TTM. 


1.2. Known Results 


If one looks at machine types apparently more powerful than the 
straightforward RAM, results are already known establishing that these 
machines can simulate a 1TTM (or even possibly a general Turing 
machine) in sub-linear time. Dymond (1982) shows a simulation by a 
parallel RAM in time O(./T) (where T is the time taken by the TM). 
Hopcroft et al. (1975) describe a method of “rectangles” which will enable 
a non-deterministic RAM to simulate a 1TTM in time O(T/log T). Finally, 
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Robson (1984) gives a simulation by a probabilistic RAM in expected time 
O(T/flog log T). 

For a (deterministic sequential) RAM the best known result is the 
Hopcroft et al. (1975) simulation whose time is O(T/log T) under the 
uniform cost criterion but Q(T) under logarithmic cost. 


1.3. New Results 


This paper extends the probabilistic result quoted above to the deter- 
ministic case; that is, it shows a deterministic simulation of a 1TTM whose 
(logarithmic) cost is O(T/log log T). In the probabilistic case it was 
assumed that T>nlogn, where n is the input length (strictly we should 
write T(n) for the time but we abbreviate it to T); this assumption will be 
weakened to T >n log log» with a more careful argument. 

An immediate conclusion is that any language recognised in time T by 
a ITTM is recognised in time T/log log T by a RAM. A second, slightly 
less immediate, conclusion is that there are languages recognised in time T 
by a RAM but not by a 1TTM. 


1.4. Organisation of the Paper 


To clarify the main ideas of the simulation, we begin by presenting, in 
Section 2, a very simple linear time simulation using a new approach. Then 
Sections 3 and 4 describe the new simulation in terms of a transformation 
from this simple one. Section 5 analyses the time complexity of those parts 
of the simulation which are new and Section 6 describes briefly some 
features which are borrowed almost unchanged from a previous simulation 
(Robson, 1984). 

For simplicity we assume that the machine has a binary alphabet. Since 
an arbitrary machine is equivalent to one with a binary alphabet which 
runs slower by only a linear factor, this does not affect the conclusions on 
the order of the simulation time. 


2. A SIMPLE LINEAR TIME SIMULATION 


The tape of the 1TTM is notionally divided into “blocks” whose size is 
b symbols, where b is an even integer approximately equal to (log 7)/4. 
The tape of the Turing machine is represented for the simulation by an 
array TAPE each of whose elements is a b-bit integer representing the 
contents of one block. The size of TAPE is O(T/log” T + n/log T) because 
a 1TTM computation which halts after time T can only have used 
O(T/log T) tape outside the section containing the input. 

The actions of the simulation correspond to “events” where an event is 
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defined as the crossing of a block boundary different from the one crossed 
at the previous event. Thus two events are separated by at least b steps 
giving O(T/log T) events. The description of the configuration is completed 
by three scalar variables, q, Bno, and d: q is the machine’s state at this 
boundary crossing and the boundary crossed is that between blocks Bno 
and Bno + 1 in direction d. 

Between one event and the next, the Turing machine head remains in 
blocks Bno and Bno+1 so that the actions are determined by q, d, 
TAPE[Bno], and TAPE([ Bno +1]. Before the simulation proper, a four 
dimensional array called LOOKUP is established which describes the 
machine’s actions for every quadruple (state, direction, block, block); this 
description is another similar quadruple giving the state and direction of 
the machine at the next event and the resulting configuration in the two 
blocks affected. 

Now the simulation of the step from one event to the next is described 
by the following Pascal statements which are clearly easy to translate into 
RAM instructions (apart from the use of multi-dimensional arrays, for 
which see Section 6). 


move := LOOKUP[q, d, TAPE[ Bno], TAPE[Bno +1]]; 
TAPE[ Bno] := move. Leftblock; 

TAPE[ Bno + 1] := move. Rightblock; 

q := move state; 

d := move.direction; 

if d= Left then Bno := Bno — 1 else Bno := Bno + | 


There are two reasons why this simulation takes time Q(log T) for each 
event and therefore Q(T) overall. On the one hand the elements of array 
TAPE (and therefore also of LOOKUP) have Q(log T) bits and on the 
other hand Bno has Q(log T) bits. The next two sections describe a 
different way of representing the contents of the machine’s tape which 
reduces the cost of accessing it, not at every event but at enough events to 
reduce the average time for these accesses by a factor of log log T. 
Reducing the time caused by the accesses to Bno, which is less interesting 
and less novel, is discussed briefly in Section 6. 


3. PHASES OF THE SIMULATION OF A BLOCK 


The basic idea behind speeding up the simulation is that the current 
contents of a block should be represented by a number of variables, one 
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“large” variable of Q(log T) bits and some smaller ones. At most events 
only the smaller variables are referred to and updated but after about 
log log T events in the block, the large variable also needs to be updated. 
The time spent in the block between two occasions when the large variable 
is used is a “phase” of the simulation of the block. 


DEFINITION. The “superblock” of a given tape block is the section of 
tape of size 26 symbols centred on the block; that is, it consists of the block 
itself and the adjoining b/2 symbols of each neighbouring block. 


DEFINITION. The “time attributed” to a tape block consists of every 
Turing machine step after which the read/write head is within the super- 
block of the block. 


Thus each Turing machine step is attributed to exactly two blocks. 


DEFINITION. Given the superblock surrounding a block at an event, a 
“feasible” block is a block which might arise in the same position at a sub- 
sequent event with at most (b log 5)/(3 log S) steps attributed to the block 
between the two events (where S is the number of states of the Turing 
machine). 


LEMMA. For a given superblock, the number of feasible blocks is b!’ +°% 


The proof of this lemma is deferred to an appendix but a simple argu- 
ment shows its plausibility: to carry log S bits of information into the block 
from outside the superblock, the machine needs b steps (b/2 on the way in 
and again on the way out); thus within the stated number of steps, only 
(log 6)/3 bits can have been carried in. 


DEFINITION. The computation in a block is divided into “phases” where 
the first phase starts at the first event in the block and a phase ends at the 
last event in the block or at the next event which produces a configuration 
in the block which is not a feasible block with respect to the superblock 
which surrounded the block at the start of the phase. 


Since a phase which ends with a non-feasible block arising must include 
at least (b log b)/(3 log S) steps attributed to the block, the total number of 
phases over the whole computation is bounded by 27/(é log 5/3 log S) + 
(number of blocks on the tape) which is O(T/log T log log T). 
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4. A REPRESENTATION OF THE TAPE ALLOWING FAST SIMULATION 


4.1. The Representation 


Now we can be more precise about the large and small variables 
representing a block mentioned at the start of Section 3. The large variable 
for a block contains a 2b bit number giving the superblock’s contents at the 
start of the current phase. The small variables include one which is the 
index of the current block configuration in a list of all feasible blocks for 
this superblock. By the lemma the size of this small variable is O(log b) 
bits. The actual value of the current block configuration could be looked 
up in a two dimensional array FEASIBLE, though this is not necessary for 
the simulation of most events. Thus the relation between the simple simula- 
tion of Section 2 and the new one could be expressed as 


TAPE[i] = FEASIBLE[SUPERBLOCK{], index[i]] 


The representation of the tape is completed by a third variable, 
associated with the boundary between each pair of adjacent blocks; 
BOUNDARY{[i] contains a value dependent on SUPERBLOCK[i] and 
SUPERBLOCK[i+ 1] in a way to be explained in Section 4.2. 


4.2. Simulating Normal Events 


We designate an event as “normal” if it is internal to a phase of each of 
the two blocks on whose boundary it occurs. This term could be regarded 
as inaccurate since there is no guarantee that any normal events occur (if 
the number of steps between events is often very large). At a normal event 
it is necessary to take a quadruple (index, index, state, direction) indicating 
the two blocks and the Turing machine’s behaviour and produce a new 
similar quadruple indicating the changed blocks and the machine’s 
behaviour at the next event. As long as the SUPERBLOCKs of the two 
blocks on each side of this boundary remain the same, this is achieved by 
a particular function from quadruples to quadruples. The BOUNDARY 
value encodes this function, containing the function values for all possible 
arguments packed into a single integer. 

The unpacking of the required quadruple from the boundary value is 
done by lookup in a fixed array SELECT and the components of the 
quadruple are themselves selected by lookup in four more fixed arrays, 
giving as the equivalent of the Pascal statements of Section 2 


move := SELECT[q, d, INDEX[Bno], INDEX[Bno + 1], 
BOUNDARY[Bno]]; 


Leftindex := LIND[ move]; 
Rightindex := RIND[move]; 
if (Leftindex =0) or (Rightindex =0) then Endofphase 
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else begin 
INDEX Bno] := Leftindex; 
INDEX[ Bno + 1] := Rightindex; 
q := STATE[move]; 
d := DIRECTION [move]; 
if d= Left then Bno := Bno — 1 else Bno := Bno + 1 
end 


where an index value of zero has been used to flag a non-feasible block. 


4.3. Updating at an End of Phase 


At the end of a phase the array FEASIBLE is used to find the actual 
current contents of the blocks concerned and then these are used to 
simulate the step to the next event and to update all affected SUPER- 
BLOCK and BOUNDARY values. Two further fixed arrays are used for 
this: 

SB[B1, B2, B3] gives the superblock corresponding to block B2 with B1 
and B3 to its left and right, 

BDRY[SB1, SB2] gives the boundary value for the boundary with 
superblocks SB1 and SB2 to its left and right. 

Thus the Endofphase procedure is 


Lblock := FEASIBLE[SUPERBLOCK[Bno ], INDEX[ Bno] |; 
Rblock := FEASIBLE[ SUPERBLOCK[ Bno + 1], INDEX[ Bno+1]]; 
move := LOOKUP{[g, d, Lblock, Rblock ]; 


{exactly as in Section 2} 
if Leftindex = 0 then 


begin 
Farleft := FEASIBLE[SUPERBLOCK|[ Bno — 1], 
INDEX[Bno — 1]]; 
SUPERBLOCK[ Bno] := SB[ Farleft, move. Leftblock, 
move. Rightblock ]; 


BOUNDARY([Bno — 1] := BDRY[SUPERBLOCK[Bno — 1], 
SUPERBLOCK[Bno]]; 


INDEX| Bno]:=1 
end 
else INDEX[ Bno] := Leftindex; 
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if Rightindex =0 then 
begin 
Farright := FEASIBLE[SUPERBLOCK[Bno + 2], 
INDEX[Bno + 2]; 


SUPERBLOCK[ Bno + 1] := SB[ move. Leftblock, 
move. Rightblock, Farright ]; 


BOUNDARY[Bno + 1] := BDRY[SUPERBLOCK[Bno + 1], 
SUPERBLOCK[Bno + 2]]; 


INDEX[Bno+1]:=1 
end 
else INDEX[Bno + 1 ] := Rightindex; 


BOUNDARY[Bno] := BDRY[SUPERBLOCK[Bno], 
SUPERBLOCK[Bno + 1]]; 


q := move. State; 
d := move. Direction; 
if d= Left then Bno := Bno — 1 else Bno := Bno + 1 


4.4. Setting up the Fixed Arrays 


A number of fixed arrays have been mentioned as being used in 
simulation proper and must be set up in a preliminary stage. They are 
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SELECT([1..S, Left.. Right, 1.. Maxindex, 1..Maxindex, 0..Maxboun- 


dary], 
LIND, RIND, STATE, DIRECTION[0.. Maxquad], 


SB[0..Maxblock, 0.. Maxblock, 0..Maxblock], 
BDRY([0..Maxsuperblock, 0.. Maxsuperblock ], 


FEASIBLE{0..Maxsuperblock, 1.. Maxindex | 
and 
LOOKUP[1..S, Left.. Right, 0.. Maxblock, 0.. Maxblock ], 


where Maxindex, Maxboundary, Maxblock, Maxsuperblock, and Maxquad 
are respectively the largest integers representing an index, a boundary, a 


block, a superblock, and a quadruple (state, direction, index, index): 
Maxblock = 2° —1, 
Maxsuperblock = 27° — 1, 
Maxindex = b'? +0), 
Maxquad = 2SMaxindex* — 1, 
Maxboundary = (Maxquad + 1)Maqued+t _ 1. 
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Of these arrays SELECT, LIND, RIND, STATE, DIRECTION, and SB 
are simply to provide shifting and masking operations which are not in the 
RAM instruction set. These are all easily set up by loops whose structure 
is independent of the details of the Turing machine. 

Each element of LOOKUP is indexed by a (state, direction, block, 
block) quadruple and describes the outcome if the machine crosses the 
boundary between the two blocks in the given state and direction. The 
computation on this pair of blocks is simulated until one of four things 
happens: 


(i) the computation leaves the pair of blocks: the outcome is 
recorded in LOOKUP immediately, 


(ii) the computation recrosses the boundary between the two 
blocks: this establishes a different (state, direction, block, block) quadruple 
which is recorded as the “successor” of the original one, 


(iii) the number of simulated steps exceeds 2°bS indicating looping 
within the block, 


(iv) the computation halts: this fact is recorded together with 
whether the input was accepted. 


Then for any (state, direction, block, block) whose outcome is not yet 
known, the chain of successors is followed until either it loops or a known 
outcome is found and then this outcome is recorded for every member of 
the chain. 

To find the list of FEASIBLE blocks for each superblock, we consider 
every (superblock, direction) pair and simulate S computations entering 
the superblock in this direction (one computation for each state) for up to 
(b log 6)/(3 log S) steps. Any event occurring during this simulation gives a 
feasible block for this pair and this block is recorded as feasible together 
with the time taken to reach it. If the computation leaves the superblock, 
we know that when it returns, the superblock contents will be unchanged 
and the direction will be reversed; this (superblock, direction) pair is 
recorded as a “successor” of the original one together with the time spent 
in the superblock. Then each pair has its list of known feasible blocks 
extended to include the known feasible blocks of its successors provided the 
total time in the superblock to reach them is < (b log b)/(3 log S). This is 
continued until some sweep finds no new information about feasible blocks 
and then the two lists of feasible blocks for each superblock are merged to 
form the appropriate row of FEASIBLE. 

Finally BDRY is set up. For every pair of compatible superblocks 
(“compatible” meaning that the one set up later must contain half of a 
feasible block of the other in its overlap with the central block of the 
other), Maxquad + 1 values are computed and packed together to form the 
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appropriate element of BDRY. Each value is computed by considering 
the actions of the Turing machine after crossing the boundary between the 
blocks of the two superblocks with some quadruple (state, direction index, 
index) describing the machine and the two blocks: 


(a) FEASIBLE gives the actual contents of the two blocks, 
(b) LOOKUP gives the actual result of the computation, 


(c) the indices corresponding to the resulting blocks are found by 
searching in the appropriate rows of FEASIBLE, 


(d) the resulting state, direction, and two indices are packed together 
to give the required value. 


5. TIMING 


5.1. Precomputation 


The first thing the simulating program does is read its input. As the input 
is read, it is packed “on the fly” so that when z bits have been read, they 
are packed into integers of about (log 2)/4 bits. Provided the packing is 
done by the algorithm mentioned in the next section, the time to do this 
is O(n log log n). If T is easy to compute, then the simulation now proceeds 
with the correct value of T; otherwise a succession of values 7; are chosen 
(To is the first power of 2 exceeding n, T,;,,=27;) until the simulation 
assuming T= T, does halt within T, simulated steps. The total time taken 
by the unsuccessful simulations will not exceed twice that taken by the 
successful one. 

Of the fixed arrays which must be set up before the simulation proper, 
those which do not depend on the details of the Turing machine, namely 
SELECT, LIND, RIND, STATE, DIRECTION, and SB, can all be easily 
set up in time 


O(Number of elements x Polynomial(log( Maximum element or subscript ))) 
= O(2” x Polynomial(d)) 
= 0(T/log log T). 


The O(2”) simulations in setting up LOOKUP each take time 
O(2° x Polynomial(b)) and the process of following chains of successors to 
complete LOOKUP takes time O(2”° x Polynomial(b)) since a link from an 
element of a chain to its successor is followed only twice. Thus the total 
time for LOOKUP is again O(2” x Polynomial(6)). 

The O(2”°) simulations in setting up FEASIBLE each take time 
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O(Polynomial(b)) and the process of combining the known feasible blocks 
of each superblock’s successors with its own known feasible blocks must 
terminate after at most (b log b)/(3 log S) iterations giving a total time of 
O(2? x Polynomial(d)). 

BDRY has 2* elements but the only ones ever set up or accessed are 
those whose two subscripts are compatible, effectively reducing the number 
of elements to O(23°° x Polynomial(h)). Since the elements are bounded by 
Maxboundary and 


log Maxboundary = O( Maxquad log Maxquada) 
as O(b?B +e) log b) 
= O( B23 +) 


the time to set up each element of BDRY is O(Polynomial(b)) giving a 
total of O(23°° x Polynomial(b)) = 0( T/log log T). 
Hence the total precomputation time is o( 7/log log T). 


5.2. The Simulation Proper 


As noted in Section 3, the number of phases is O( T/log T log log T) and 
all the variables used in simulating a phase’s start or end have O(log T) 
bits. Thus the total time in these steps is O(T/log log T). Asymptotically 
this will dominate the total simulation time. 

The number of normal events is at most 7/b and all the variables used 
in simulating a normal event, except the block number Bno, have 
0(b?3+*%)) bits, giving a total time for these simulations of 0(7b°)~ 3) = 
o(T/log log T), excluding only the time spent incrementing and decre- 
menting Bno and using it as a subscript for INDEX and BOUNDARY a 
total of O(7/log T) times. 


6. CUTTING THE COST OF MANIPULATING LARGE SUBSCRIPTS 


We now outline briefly the method described fully in (Robson, 1984) of 
reducing the RAM time spent in handling large arrays of relatively small 
elements when references to the array elements show suitable locality. 

Suppose that X is an array of O(T) elements, each of O(log! ‘T) bits, 
and that references to X are all via a subscript i which is only updated by 
incrementing or decrementing it by 1. Then a different representation of X 
reduces the average cost of an operation on / or a reference to X from 
O(log T) to O(log '~*T log log T). 

Elements of X are grouped into “pages” of p = Q(log ‘T) so that the total 
number of bits in a page is approximately (log T)/2. Two of these pages are 
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called “current,” namely those on each side of the last page boundary 
crossed by the value of the subscript i The elements of ¥ within these 
current pages are kept in a small array of O(log T) elements so that they 
are accessed fast. Each other page has all its elements packed together into 
a single integer and stored in a larger array indexed by page number. The 
variable i is replaced by three variables, namely pagestart, the index (in X) 
of the first element of X within the current pages, rem = i — pagestart, and 
pageno = pagestart/p. 

Now the time to access elements of X within the current pages is 
O(log '~*T) but when a new page becomes current, elements of X have to 
be packed and unpacked. This packing and unpacking can be done in time 
O(log T log log T) by a divide and conquer method and occasions when it 
is necessary are separated by O(log “T) operations on i. Thus the average 
time for an operation on / or a reference to X[i] is O(log’ ~*T log log T). 

This approach can be used to handle the arrays BOUNDARY and 
INDEX and the variable Bno used to index them. The value of Bno is still 
available for use in other ways as pagestart + rem. 

Details of this approach and of the packing and unpacking algorithms 
can be found in (Robson, 1984). That paper also describes one method 
of handling multi-dimensional arrays by address calculation; arrays of 
pointers to lower dimensional arrays would be equally satisfactory. 


7. CONCLUSIONS 


From all the previous discussion it is clear that the whole simulation 
time is O(n log log n + T/log log T). Thus we have immediately 


THEOREM 1. Any language recognised in time T(n) by a single tape 
Turing machine is recognised by a Random Access Machine in time (under 
the logarithmic cost criterion) O(max(T(n)/log log T(n), n log log n)). 

If the Turing machine had a read only input tape and a separate work 
tape, then a similar simulation could be done incorporating into the “state” 
of the simulated machine an indication of the position of the input tape 
head. Since the simulation time was proportional to the logarithm of the 
number of states, this would give a simulation in time O(max(T(n) 
log n/log log T(n), nlogtogz)), an improvement only if T is at least 
exponential. 

As a fairly simple consequence we have also 


THEOREM 2. If T,(n)=Q(nloglogn) is computable by a (log cost) 
RAM in time O(T,(n)) and T,(n) = 0(T,(n) log log T,(n)), then there is a 
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language recognised by a RAM in time O(T,(n)) but not recognised by any 
single tape Turing machine in time O(T,(n)). 


Proof. By (Cook and Reckhow, 1973) there exists a language 
recognised by a RAM in time O(T,(”)) but not in time O(max(T,(n)/ 
log log T,(n), n log log n)). This language cannot be recognised by a 1TTM 
in time O(T,(n)) because the simulation would give a RAM recogniser 
with time O(max(T>(n)/log log T,(n), n log log n). 


The conclusion of Theorem 2 holds also for T, between n and n log log n, 
but for quite different reasons: by the result of (Trakhtenbrot, 1964) the 
languages recognised by a single tape Turing machine in time O(T,(n)) are 
also so recognised in linear time and so are regular; but the languages 
recognised by a RAM in time O(7,(n)) include all the regular languages 
and also {a’b'}. 

The questions of whether there is a sublinear time simulation of 1TTM 
by TM and/or of TM by RAM and more weakly whether the increase in 
the set of languages recognised comes in the step from 1TTM to TM or 
from TM to RAM (or both) are all still open and full of interest. 


APPENDIX 


LEMMA. Given a section of Turing machine tape consisting of 2b cells 
numbered 1 to 2b, we define an “event” to be an occasion when the 
machine crosses one of the boundaries of the “central block” of the section, 
(namely the section from cells b/2+1 to 35/2 inclusive) unless, since the 
last event, the machine has neither crossed out of the section nor traversed 
the central block. Given the configuration of the whole section at a time to 
at which an event occurs, a “feasible block” is a configuration which could 
occur in the central block at the time of a subsequent event at a time from 
fo up to the completion of (b log 6)/(3 log S) computation steps within the 
section. 

The number of feasible blocks for a given configuration at fy is b'P +). 


Proof. Suppose the number of feasible blocks is f. We consider f 
computations, one producing each feasible block. For each i (1 <i< 6/2), 
we consider the four boundaries L, (defined as the boundary between 
cells i and i+1), X, (b/2 +i and b/2+i+1), Y;(b+i and b+i+1) and 
R, (36/2+i and 36/2+i+1). The f computations contain on average at 
least 2 log f/log S— O(1) steps which cross one of these four boundaries 
(otherwise two of the f computations would be identical in their odd 
numbered positions and since every crossing into the area between L, and 
R, occurs at an odd numbered position, these two computations would 
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produce identical results in this area contradicting the supposition that 
they produce different configurations between 6/2 and 36/2). 

Now summing over i from 1 to 5/2, we conclude that the average 
number of steps within the section for the f computations is at least 
b(log f/log S— O(1)), but this average cannot exceed (b log b)/(3 log $). 

Hence b(log f/log S— O(1)) < (b/log 6)/(3 log S) 

giving log f < log b/3 + O(1) and finally f= b'? 7°. 
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